Public Transportation BigData Clustering

نویسندگان

  • Tomislav Galba
  • Zoran Balkić
  • Goran Martinović
چکیده

An increase in the use of GPS modules and cell phones with location services has created a need for new ways of collecting and storing data. Considering a fairly large number of devices, data collected in such way in most cases take up a vast amount of space on servers while on the other hand, they represent a source of very useful information. A large number of companies use this method of data collection in order to create prediction models, reports and data analysis. As an object of observation, we use the database of a modern public transportation system which contains information about vehicle telemetry. In this paper, we will describe the application and result analysis of some well-known clustering algorithms in order to solve public transportation problems like traffic congestion, passenger transport, etc.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Implementation of Good Bibimbap Restaurant Recommendation System Using TCA based on BigData

This paper designs the model of bibimbap restaurant recommendation system in BigData(collected from Twitter's data related to bibimbap). We suggests a TCA (Termite Colony Algorithm) k-means algorithm for clustering BigData, TCA algorithm that used the habits of termites. Through the TCA, finding the appropriate initial clustering needed for the K-means clusters is the goal. We recommend good Bi...

متن کامل

DxNAT - Deep Neural Networks for Explaining Non-Recurring Traffic Congestion

Non-recurring traffic congestion is caused by temporary disruptions, such as accidents, sports games, adverse weather, etc. We use data related to real-time traffic speed, jam factors (a traffic congestion indicator), and events collected over a year from Nashville, TN to train a multi-layered deep neural network. The traffic dataset contains over 900 million data records. The network is therea...

متن کامل

Enhancing Map-Reduce Framework for Bigdata with Hierarchical Clustering

MapReduce is a software framework that allows certain kinds of parallelizable or distributable problems involving large data sets to be solved using computing clusters. This paper introduces our experience of grouping internet users by mining a huge volume of web access log of up to 500 gigabytes. The application is realized using hierarchical clustering algorithms with Map-Reduce, a parallel p...

متن کامل

Analysis of Extended Performance for clustering of Satellite Images Using Bigdata Platform Spark

Due to the recent emergence Clustering techniques have been widely adopted in many real world data analysis applications, such as customer behavior analysis, targeted marketing, digital forensics, etc. As the satellite imagery is getting generated at a higher rate than the previous decades, it becomes essential to have better solutions in terms of accuracy as well as performance. In this paper,...

متن کامل

HIERARCHICAL DATA CLUSTERING MODEL FOR ANALYZING PASSENGERS’ TRIP IN HIGHWAYS

One of the most important issues in urban planning is developing sustainable public transportation. The basic condition for this purpose is analyzing current condition especially based on data. Data mining is a set of new techniques that are beyond statistical data analyzing. Clustering techniques is a subset of it that one of it’s techniques used for analyzing passengers’ trip. The result of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014